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Abstract. The application of the linked data principles to promote the 
availability of open interlinked datasets has resulted in the creation of the 
so-called Linked Open Data (LOD) cloud, consisting of 338 datasets, of 
which 51 are geographic data whilst the rest are data from non geographic 
domain. However, presently, integrating linked open data directly into open 
source web mappi ng is a chal lengi ng task si nee li nked data sources are out- 
side of open source web mappi ng envi ronment. M oreover, web map servers 
cannot consume RDF data directly. Our current research is aimed at finding 
novel ways of visualising linked data in the form of thematic maps over the 
internet on-the-fly. In this paper, we present the results of experiments 
with integrating non-spatial linked open data into an open source web 
mapping environment and visualising them as thematic maps. We show 
experiments of web thematic maps created with choropleth and propor- 
tional symbol techniques using our geospatial thematic web service based 
on open source web mapping technology. Our results show that it is possi- 
ble to integrate non-spatial linked open data with traditional geospatial 
data using open source web mapping technology. However, access, data 
conversion and data integration are some of the main challenges in creating 
web thematic maps with existing web mapping tools from traditional geo- 
spatial data i ntegrated with non-spatial linked open data on-the-fly. 
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1. Introduction 

Linked data enables links to be set between items in different data sources 
and therefore connect these sources into a single global data space (Heath & 



Bizer 2010). Linked data is open, accessible and its data representations are 
standardized (Heath & Bizer 2010). The creation of a web of Linked Open 
Data (LOD) ai med at promoti ng the avai I abi I ity of open i nterl i nked datasets 
has resulted in the creation of the so-called LOD cloud. This presents an 
advantage to data consumers since data and information schema becomes 
available and accessible on the Web (Fensel 2011). Presently the LOD 
Cloud 1 is made up of 338 datasets out of which 51 datasets are from the ge- 
ographic domai n whi I st the rest are data from non geographi c domai n . 

There is a growing demand for thematic information for a multitude of ap- 
plications but thematic data sets are highly heterogeneous in syntax, struc- 
ture and semantics (Durbha et al. 2009). Producing thematic maps over the 
I nternet and the WWW coul d therefore take advantage of the avai I abi I ity of 
linked open data in the LOD cloud. However, presently, integrating non- 
spatial linked data into open source web mapping is a challenging task. Our 
current research is aimed at finding novel ways of dynamically visualising 
I i nked open data i n the form of themati c maps. One of our goals was to de- 
velop a geospatial web service dedicated to producing thematic maps with 
non-spatial linked data. We called it a geospatial thematic web service. In 
this paper, our objective is to present results of experiments with the im- 
plementation of this geospatial thematic web service which uses open 
source software and tools to integrate non-spatial linked open data with 
geospatial data on a web server to produce web thematic maps. 

We show experiments with maps created using choropleth and proportional 
symbol mapping techniques using our GeoServer powered WMS based geo- 
spatial thematic web service which creates thematic maps by integrating 
non-spatial linked open data. The latter were accessed via DBpedia's 
SPARQL end point and integrated into a PostGIS spatial database via SQL 
script. Our results show that access, data conversion and data integration 
are some of the main challenges in creating web thematic maps with exist- 
ing web mapping toolsfrom traditional geospatial data i ntegrated with non- 
spatial linked open data. We aim to automate this process in the next stage 
of our research so that from a si ngl e cl i ent request themati c maps are creat- 
ed on-the-fly, consuming linked data, applying styling, publishing data to 
the web service and displaying the thematic map to the client. We review 
concepts and related work in section 2 and section 3 respectively. We dis- 
cuss our research approach in section 4 with design and implementation of 
our geospatial thematic web service. We present our results in section 5. 



1 Linking Open Data Cloud: http://datahub.io/group/ lodcloud?q=. Accessed 22 March 2013 



Discussions, Conclusions and Future Work are presented under section 6, 
section 7 and section 8 respectively. 



2. Review of Concepts 

I n this section we briefly introduce the main concepts related to our work, 
namely linked data, thematic maps, open source web mapping and geospa- 
tial web services. 

2.1. Linked Data 

Berners-Lee (2006)outlines architectural principles of linked data which 
have been adopted by an increasing number of data providers, leading to 
the creation of a global data space containing billions of assertions - the 
Web of Data (Bizer et al. 2009(a)). 

The creati on of a web of I i nked open data is promoted by the Li nki ng Open 
Data community project 2 aimed at promoting the avail ability of open inter- 
linked datasets resulting in the creation of the so called LOD cloud. The 
LOD cloud diagram by Richard &J entzsch (2011) is shown in Figure 1 The 
nodes are the linked datasets and the arrows show interlinks to other da- 
tasets in the cloud. 

Information about resources on the Web is represented in RDF (Breitman 
et al. 2010, Fensel 2011). RDF makes it possible to write statements about 
resources with each statement consisting of a subject, predicate and object 
forming a triple. Several triplesformagraph. 

Prud'hommeaux & Seaborne (2008) define SPARQL as the query language 
for RDF. Some linked data providers provide RDF dump or SPARQL end- 
point for their linked datasets. An RDF dump, usually a large RDF docu- 
ment, contai ns the RDF graph whi ch makes up the enti re I i nked dataset but 
a SPARQL endpoint is an HTTP-based query service that executes SPARQL 
queries over the I inked dataset (Hartig& Langegger 2010). 

The geospatial thematic web service that we present in this paper consumes 
non-spatial linked data in RDF via a SPARQL end point. 



2 http:esw.w3.org/topicSweol G/TaskForce/CommunityProjects/ LinkingOpenData. 
Accessed 22 March 2013 



2.2. Thematic Maps 

Cartography is the application of art, science and technology to make maps 
(Cartwright et al. 2009) .Thematic cartography is a branch of cartography 
that deals with the production of thematic maps (Slocum et al. 201o). The- 
matic maps normally feature only a single distribution or relationship over 
a spatial background to help locate the distribution being mapped (Tyner 
2010). 
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Figure L LOD Cloud diagram and extract of the LOD Cloud diagram (Source: 
Richard &J entzsch (2011)) 



Herzog (2003) observed that thematic cartography could more fully utilize 
the potential of the Web to communicate spatial information to larger audi- 
ence and concluded that there are many possible applications of thematic 
mappi ng to the context of Web. 

Choropleth and proportional symbol mapping are two of several carto- 
graphic techniques used in creating thematic maps (Slocum et al. 2010). 
Choropleth maps are constructed by grouping data for enumeration units 
(eg. countries, states, districts) into classes and assigning either a colour or 
a gray tone to each class. Proportional symbol maps are created with sym- 
bols scaled in proportion to the magnitude of data arising from a particular 
point. I n this paper we present results of experiments with web maps creat- 
ed using choropleth and proportional symbol techniques. 

2.3. Open Source Web Mapping and OGC Web Services 

A geospatial web service is a specialised type of web service that processes 
geospatial data. Geospatial web services are a convenient means of provid- 
ing access to the large volume of geospatial data over the web (Breitman et 
al. 2010). In this paper, we refer to open source web mapping in the context 
of usi ng open source software and tool s to create geospati al web servi ces. 

The International Organisation for Standardisation's (ISO) technical com- 
mittee, ISO/TC211 Geographic Information/Geomatics 3 , develops stand- 
ards for information concerning objects or phenomena that are directly or 
indirectly associated with a location relative to the Earth (Kresse & Fadaie 
2010, Coetzee 2011). This includes geospatial web services. WMS is jointly 
published by ISO and OGC as ISO 19128:2005, Web Map Server interface 
and OpenGIS Web Map Service (WMS) Implementation Specification (ISO 
2005). It standardises the way maps are requested over the Internet and 
how servers describe their data holdings (Kresse & Fadaie 2010). A WMS is 
defined by De la Beaujardierre (2006), as a service that produces maps of 
spatially referenced data dynamically from geographic information. Anoth- 
er OGC standard, the Styled Layer Descriptor (SLD) specifies how a WMS 
can be extended to allow user-defined styling (Lupp 2007). 

Our geospatial thematic web service is a WMS that uses SLD to create and 
present thematic maps. 



3 ISO/TC 211: http://www.isotc211org . Accessed 22 March 2013 



3. Related Work 



Our geospatial thematic web service combines geographic information on a 
database server with non-spatial linked data and styling (SLD) to produce 
thematic web maps as shown in Figure 2. In its current implementation the 
GeoServer powered geospatial thematic web service combines geospatial 
data in PostGIS with non-spatial linked data from DBpedia. Our geospatial 
thematic web service is needed in cases where: 1) An existing WMS has to 
be migrated to using linked data, 2) Statistical agencies publish statistical 
data as linked data to the LOD cloud and 3) Geospatial data is too big to be 
accessed over the web and or it is already available locally. Moreover it is 
nice to create thematic maps from all the attribute data available on the 
web. I n this section we present examples of efforts in representing and que- 
rying linked data in the geospatial context and explain how they are related 
to our work. We also highlight DBpedia as a source of non spatial linked 
data source for our geospatial thematic web service. 
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Figure 2. Spatial, non spatial (attributes) and styling components of the geospa- 
tial thematic web service. 



3.1. Representing Geospatial Linked Data 

W3C Geospatial I ncubator Group (GeoXG) 4 published: Geospatial Vocabu- 
lary defines a basic ontology and OWL vocabulary for representing geospa- 
tial properties for Web resources (Lieberman et al. 2007(a)). This report 
presents a model for basic feature properties of Web resources and a reali- 
zation of these feature property elements as XML and OWL/RDF vocabu- 
laries. Geospatial Ontologies, another report by Lieberman etal. (2007(b)), 
provide description of geospatial foundation ontologies that can be used to 
represent geospatial concepts and properties on the worldwide web. 
GeoRDF 5 is an RDF compatible profile for geospatial information. It de- 
fines profiles for points, lines and polygons. Our geospatial thematic web 
service in its current implementation consumes non-spatial linked data 
(attribute data) from the LOD cloud. I n the next phase of our research we 
plan to integrate geospatial linked data modelled according to the Geospa- 
tial Vocabulary, Geospatial Ontologies and GeoRDF into our geospatial 
themati c web servi ce. 

3.2. Querying Geospatial Linked Data 

Recently, OGC has published a standard, OGC GeoSPARQL which supports 
representing and querying geospatial data on the Semantic Web (Perry 
2012). GeoSPARQL defines a vocabulary for representing geospatial data in 
RDF. It also defines an extension to the SPARQL query language for pro- 
cessing geospatial data. We do not need GeoSPARQL for the geospatial 
thematic web service in its current implementation because we are using 
only non-spatial attribute data from the LOD cloud. 

3.3. DBpedia 

DBpedia 6 : The DBpedia is a crowd-sourced community effort aimed at ex- 
tracti ng structured data from Wi ki pedi a. The DBpedi a knowl edge base con- 
sists of 189 billion pieces of information (RDF triples) out of which 400 
million were extracted from the English edition of Wikipedia and 
146billion extracted from other language editions. An increasing number of 
linked data providers have set data-level links to DBpedia resources, mak- 
ing DBpedia a central interlinking nucleus for the LOD cloud (Bitzer et al. 
2009 (b)). Our geospatial thematic web service is a web service that con- 



4 www.w3.org/ 2005/ 1 ncubator/ geo/ . Accessed 22 M arch 2013 

5 www.w3.org/ wiki/GeoRDF#lmplementation. Accessed 22 March 2013 

6 http://wiki.dbpedia.org/DBpediaMobile . Accessed 22 March 2013 



sumes non spatial linked data (RDF) from DBpedia and creates web the- 
matic maps. 

3.4. map4rdf 

The GeoLinkedData I nitiative developed map4rdf 7 , a mapping and faceted 
browsing tool which can be configured with a SPARQL endpointto provide 
exploration and visualisation of RDF resources enhanced with geometric 
information. map4rdf provides geospatial and geometrical visualisation 
using Google Maps and Open Street Maps. map4rdf provides visualization 
tools for a single source of geospatial RDF data, while our geospatial the- 
matic web service links non-spatial linked data with geospatial data to pro- 
duce thematic maps. 

3.5. Summary 

I ntegrating non-spatial linked data with geospatial data for web based the- 
matic mapping has not received much attention to date. We follow this nov- 
el approach of integrating non-spatial linked data in an open source web 
environment to produce thematic maps. 



4. Creating a Geospatial Thematic Web Service 

In this section we show design and implementation of our geospatial web 
service that consumes non- spatial linked data from the LOD cloud. 

4.1. Selecting State-of-the-art Open Source Tools 

The following software packages were selected based on our previous re- 
search work (Owusu-Banahene & Coetzee 2012) and review of literature: 
GeoServer, PostgreSQL and PostGIS. GeoServer was required to create a 
WMS, to provide support for styling through SLD and SLD extensions, and 
to create a direct connection to the PostGIS database. PostGIS is a spatial 
extensi on to the PostgreSQL database management system. 

4.2. Design and Implementation of the Geospatial Thematic 
Web Service 

Our design of the geospatial thematic web service consisted of two main 
components: the web mapping environment and the linked open data ac- 
cess- and- integration mechanism. Figure3 is a high level architecture show- 
i ng the mai n components of the geospati al themati c web servi ce. The I i nked 



7 http:// oeqdev.dia.fi .upm.es/ projects/ map4rdf/ . Accessed 22 March 2013 



open data access-and- integration mechanism is shown in red. GeoServer 
with PostgreSQL and PostGIS spatial database provided the open source 
web mapping environment and support for WMS and SLD. Requests for 
themati c maps were made vi a a cl i ent such as a web browser. A map request 
was handled by the WMS which returned a thematic map based on the data 
published to it from the spatial database and styles (classification and sym- 
bolisation via SLD) associated with the data. The SPARQL end point ex- 
posed linked open data which were queried and stored temporarily as CSV 
files. SQL scripts were the mechanism through which linked data were fed 
i nto the web service envi ronment. 




Figure 3. High level architecture of the geospatial thematic web service. 



5. Producing Thematic Maps from Linked Data 

I n this section we present results of experiments with thematic maps creat- 
ed using non-spatial linked data from DBpedia 8 which is the nucleus of the 
LOD cloud. We show resulting thematic maps with two different thematic 
mapping techniques - choropleth and proportional symbols. 

5.1. Accessing Linked Open Data 

SPARQL queries were executed against DBpedia's SPARQL endpoint to 
retrieve names and population density per square kilometer of all land- 
locked countries. The results of the SPARQL query were stored in CSV for- 
mat to al I ow for i ntegrati on i nto the PostgreSQL database vi a SQL scri pt. 

5.2. Integrating Linked Data into PostGIS and Publishing to 



A spatial database in PostGIS called LOD with a table (World_ Countries) 
containing the names of countries and their geometric data was created. 
Figure 4 is a GetMap response from the WMS showing the WorldCoun- 
tries_ Land Locked layer. An SQL script (.sql file) was written to import the 
attri bute data and to j oi n based on country name to the geospati al data. The 
PostGIS spatial database (LOD) was connected to GeoServer. The table 
WorldCountries_LandLocked was then published asa new layer in WMS. 



Figure 4. A GetMap response showing the WorldCountries_LandLocked layer. 
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8 http:/ / wi ki .dbpedia.org/ 1 nterl i nki nq . Accessed 22 March 2013 



5.3. Creating Choropleth Maps 

A style (SLD file) was created to produce a choropleth map showing land- 
locked and non-landlocked countries. Different colours were assigned to 
polygons based on attribute data classification. Figure 5 is the resulting 
choropleth map showing landlocked and non- landlocked countries after 
applying the style to the WorldCountries_Landl_ocked layer. Another SLD 
file was created and applied to the same WorldCountries_Landl_ocked lay- 
er. A new choropleth map showing the population density (per square kil- 
ometre) of landlocked countries resulted (as shown in Figure6). 
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Figure 5. Choropleth map showing landlocked and non-landlocked countries 
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Figure 6. Choropleth map showing population density of landlocked countries. 



5.4. Creating Proportional Symbols Map 

Another SPARQL query was formulated to retrieve the name of a country 
and its nominal GDP per capita from DBpedia. In this example we applied 
different sizes of point based symbol (circle) to distinguish between differ- 
ent GDP per capita ranges: the size of a circle for a particular class is pro- 
portional to the GDP per capita. Figure 7 shows the proportional symbols 
map showing nominal GDP per capita of countries in the world. 




Figure 7. Proportional symbols map showing nominal GDP per capita of countries 
in the world. 



6. Discussion 

Our research brings to the fore some of the challenges involved in integrat- 
ing non-spatial linked open data into existing web mapping tools. Linked 
open data can be accessed through a SPARQL end point or RDF dump. 
These sources are not part of the typical open source web mapping envi- 
ronment. Access, data conversi on and data i ntegrati on are some of the mai n 
challenges in creating thematic maps with linked data on-the-fly since 
SPARQL end point cannot be accessed directly from the web mapping envi- 
ronment. M oreover, web map servers cannot consume RDF data di rectly. 

In order to overcome these challenges some middleware is required be- 
tween the web map server that hosts the geospati al data and the non-spati al 
linked open data. Other challenges have to do with standardising data and 
creating classes on-the-fly. With the current implementation of our geospa- 
ti al thematic web service, each thematic map requires a SPARQL query to 



be formulated. From a user's perspective, formulating SPARQL queries 
could be a daunti ng task. The next phase of our research is expected to pro- 
vide solutions so that users can accesslinked data without having to formu- 
late SPARQL queries themselves. 



7. Conclusion 

The availability of large volumes of non-spatial linked data over the web 
presents an opportunity for geospatial web services. The integration of non- 
spatial linked data into an open source geospatial thematic web service to 
create thematic maps, as presented by this research, demonstrates a novel 
approach to take advantage of the enormous opportunities that the Web of 
data presents to the geospatial community. The results of our experiments 
show that it is possible to create thematic maps from linked open data in 
the linked open data cloud but that it is a cumbersome process. Access, data 
conversion and data integration are some of the main challenges in creating 
thematic maps with linked data on-the-fly from the web mapping environ- 
ment. I n order to overcome these challenges there should be a bridge be- 
tween object- relational database and linked open data. 



8. Future Work 

We aim to automate the process so that from a single client request, the- 
matic maps can be created on-the-fly; consuming linked data (RDF and 
GeoRDF ), applying styling, publishing data to the web service and display- 
ing the thematic map to the client. For example, a client's request could be 
converted directly to SPARQL and GeoSPARQL queries and the results of 
those queries processed by the geospatial thematic web service at the 
backend with styles and presented back to the client as a thematic map. 
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